Reinforcement learning accounts for moody conditional cooperation behavior: experimental results

نویسندگان

  • Yutaka Horita
  • Masanori Takezawa
  • Keigo Inukai
  • Toshimasa Kita
  • Naoki Masuda
چکیده

In social dilemma games, human participants often show conditional cooperation (CC) behavior or its variant called moody conditional cooperation (MCC), with which they basically tend to cooperate when many other peers have previously cooperated. Recent computational studies showed that CC and MCC behavioral patterns could be explained by reinforcement learning. In the present study, we use a repeated multiplayer prisoner's dilemma game and the repeated public goods game played by human participants to examine whether MCC is observed across different types of game and the possibility that reinforcement learning explains observed behavior. We observed MCC behavior in both games, but the MCC that we observed was different from that observed in the past experiments. In the present study, whether or not a focal participant cooperated previously affected the overall level of cooperation, instead of changing the tendency of cooperation in response to cooperation of other participants in the previous time step. We found that, across different conditions, reinforcement learning models were approximately as accurate as a MCC model in describing the experimental results. Consistent with the previous computational studies, the present results suggest that reinforcement learning may be a major proximate mechanism governing MCC behavior.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin

Direct reciprocity, or repeated interaction, is a main mechanism to sustain cooperation under social dilemmas involving two individuals. For larger groups and networks, which are probably more relevant to understanding and engineering our society, experiments employing repeated multiplayer social dilemma games have suggested that humans often show conditional cooperation behavior and its moody ...

متن کامل

Confusion and Reinforcement Learning in Experimental Public Goods Games ∗

We use a limited information environment to mimic the state of confusion in an experimental, repeated public goods game. The results show that reinforcement learning leads to dynamics similar to those observed in standard public goods games. However, closer inspection shows that individual decay of contributions in standard public goods games cannot be fully explained by reinforcement learning....

متن کامل

University of Innsbruck Working Papers in Economics and Statistics Confusion and Reinforcement Learning in Experimental Public Goods Games

We use a limited information environment to mimic the state of confusion in an experimental, repeated public goods game. The results show that reinforcement learning leads to dynamics similar to those observed in standard public goods games. However, closer inspection shows that individual decay of contributions in standard public goods games cannot be fully explained by reinforcement learning....

متن کامل

Learning dynamics explains human behaviour in prisoner's dilemma on networks.

Cooperative behaviour lies at the very basis of human societies, yet its evolutionary origin remains a key unsolved puzzle. Whereas reciprocity or conditional cooperation is one of the most prominent mechanisms proposed to explain the emergence of cooperation in social dilemmas, recent experimental findings on networked Prisoner's Dilemma games suggest that conditional cooperation also depends ...

متن کامل

A comparative analysis of spatial Prisoner's Dilemma experiments: Conditional cooperation and payoff irrelevance

We have carried out a comparative analysis of data collected in three experiments on Prisoner's Dilemmas on lattices available in the literature. We focus on the different ways in which the behavior of human subjects can be interpreted, in order to empirically narrow down the possibilities for behavioral rules. Among the proposed update dynamics, we find that the experiments do not provide sign...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2017